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1.  Introduction.  There  is  a  substantial  literature  concerning  linear 
regression  when  some  of  the  predictors  (independent  variables)  are  measured 
with  error.  Such  models  are  of  importance  in  econometrics  (instrumental 
variables  models),  psychometrics  (correction  for  attenuation,  models  of  change), 
and  in  instrumental  calibration  studies  in  medicine  and  industry.  Recent 
theoretical  work  concerning  maximum  likelihood  estimation  in  such  models 
appears  in  Mealy  (1980),  Fuller  (1980),  and  Anderson  (1984),  while  Reilly  and 
Patino-Leal  (1981)  take  a  Bayesian  approach. 

In  an  applied  context, an  investigator  may  either  overlook  the  measurement 
errors  in  the  predictors,  or  choose  the  classical  ordinary  least  squares  (OLS) 
estimator  of  the  parameters  because  of  its  familiarity  and  ease  of  use.  Certainly, 
the  methodology  of  classical  least  squares  theory  (confidence  intervals, 
multiple  comparisons,  tests  of  hypotheses,  residual  analysis)  is  considerably 
more  developed  than  the  corresponding  errors-in-variables  methodology,  particularly 
in  samples  of  moderate  size.  If  the  OLS  estimator  is  used,  what  are  the 
consequences?  ~  * — 

Cochran  (1968)  has  given  a  general  discussion  of  the  consequences  of 
using  the  OLS  estimator  in  errors-in-variables  contexts.  For  the  special 
case  of  the  analysis  of  covariance  (ANCOVA),  where  the  covariates  are  measured 
with  error,  detailed  investigations  have  been  done  by  Lord  (1960),  De  Gracie 
and  Fuller  (1972)  and  Cronbach  (1976).  It  is  by  now  well-known  that  the  OLS 
estimator  £  of  the  slope  and  intercept  parameters  8  in  such  errors-in-variables 
models  is  inconsistent;  that  is,  6  does  not  tend  in  probability  to  0  as  the 
sample  size  n  becomes  infinitely  large.  However,  in  ANCOVA  with  covariates 
measured  with  error  but  balanced  (in  terms  of  means)  across  the  design,  the 
OLS  estimator  of  the  design  effects  is  known  to  be  consistent.  This  is  shown 
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in  the  two -treatment  case  by  Cochran  (1968)  and  DeGracie  and  Fuller  (1980). 

More  generally,  Gallo  (1982)  has  shown  that  for  general  linear  errors-in- 
variables  regression  models,  certain  linear  combinations  c'e  of  the  OLS  estimator 
are  consistent  estimators  of  the  corresponding  linear  combinations  of  8.  Gallo's 
result  is  reproduced  in  Section  2  as  Theorem  1. 

Let  the  rows  of  C  be  a  basis  for  all  linear  combinations  c'e  of  e  that  are 
consistently  estimated  by  c'6.  In  the  present  paper,  it  is  shown  that  under 

a  reasonable  extension  of  the  regularity  conditions  given  by  Gallo  (1982), 

1  « 

nk  (C3-C6)  has  a  limiting  asymptotic  multivariate  normal  distribution  (Theorem  2 
of  Section  2).  This  result  does  not  require  that  the  random  errors  (errors  of 
measurement,  residual  errors)  are  normally  distributed,  but  only  that  these 
errors  are  sampled  from  a  common  population  with  finite  second  moments.  However, 
Theorem  2  does  assume  that  all  predictors  are  fixed.  In  Section  3,  Theorem  2 
is  extended  to  cases  where  some  of  the  predictors  are  random  variables. 

The  nature  of  the  limiting  normal  distribution  of  n*-  (Ce-Ce)  depends  upon 
whether  the  predictors  measured  with  error  are  random  (structural  errors-in- 
variables  models)  or  fixed  (functional  errors-in-variables  models).  In  the 
former  case,  the  limiting  normal  distribution  has  a  zero  mean  vector,  while  in  the 
latter  case  the  mear.  vector  need  not  be  zero  (and  is  a  function  of  unknown 
parameters).  A  companion  paper  (Carroll,  Gallo  and  Gleser,  1985)  uses  these 
results  to  compare  the  asymptotic  efficiencies  of  the  OLS  and  maximum  likelihood 
estimators  of  Ce  when  the  errors-ln-variables  model  is  of  the  structural  kind. 

2.  Asymptotic  Theory.  Suppose  that  a  dependent  scalar  variable  y.  is 
related  to  a  vector  f^:  pxl  of  observable  predictors  and  a  vector  f^.:  qxl  of 
latent  (unobservable)  predictors  by  the  model 


(2.1) 


y.  •  fj(i,  ♦  f^«2  *  «, 


•  1  *  1 >2, . . . ,n, 


and  that  i  1S  observed  with  error  by  x^,  where 


(2.2) 


xi  =  f2i  +  ui’  i  l»2,...,n. 


For  fixed  (fj.,  fL)  it  is  assumed  that 

(2.3)  \)’  1  1  i  i  n,  are  i .i.d. 

v  ui  ' 

with  mean  vector  0  and  covariance  matrix 


z 


To  state  the  model  in  vector-matrix  form,  let 
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(2.4) 


Y  «  V,  +  ^2  +  e’  X  3  F2  +  U. 


where  the  rows  of  E  3  (e,U)  are  1.1. d.  random  vectors  with  mean  vector  0  and 
covariance  matrix  1. 

Note.  It  is  assumed  that  all  design  (dummy)  variables  are  included  in  Fj.  This 
eliminates  the  need  for  separately  including  an  intercept  term  in  the  model. 

The  OLS  estimator  of  8  for  the  model  (2.4)  is 


(2.5) 


FjF,  F|X 


X'F,  X'X 


Consistency.  To  give  asymptotic  results  about  8,  we  need 


to  make  some  assumptions  about  the  sequence 


(2.6) 


f  3  {(fjr  f 2-j ) :  1  «  1,2,...} 


of  fixed  predictor  values.  These  are  the  following. 
Assumption  1. 


lim  n 


-1  FiFl  FiF2 

F2F1  F2F2 


A11  A12 
A12  a22 


A  ,  A  >  0 . 
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Assumption  2. 

1 

lim  n”^max  [FpF,,]  =  0, 

n-«o 

where  for  any  matrix  A  =  ((a..)),  max  (A)  =  max  |a..|. 

•  J  '  *  J 


We  will  make  extensive  use  of  the  following  results. 

Lemma  1 .  Under  (2.4)  and  Assumptions  1  and  2,  for  all  (q+1 )-^imensional 
column  vectors  t. 


n" *  (FrF2)‘  (e,U)t  -  MVN(0,(fEt)A) 


in  distribution  as  n^-.  In  particular. 


(2.7)  n” *•  (f1  ,F2)  ‘  (e-Uii2)  -  MVN(0 


.[(l.-B2^(.B2) 


]a) 


in  distribution  as  n--". 

Proof.  This  is  a  direct  consequence  of  Corollary  3.2  and  the  discussion  following 
in  Gleser  (1965) .  L 

Lemma  2.  Under  the  assumptions  of  Lemma  1, 


F-iFl 

FJXN 

= 

/  A11 

A12 

X’F1 

X'X  / 

\  a12 

A22 

+  op(l). 


Proof.  From  the  weak  law  of  large  numbers, 
n*1 (e,U) 1 (e,U)  =  l  +  o  (1)  , 


(2.3) 
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while  from  Lenina  1 , 


n"1  FJU  -  0p(n'*). 


From  these  facts,  (2.4)  and  Assumption  1,  the  assertion  of  the  lemma  directly 
follows.  □ 

The  following  theorem  is  a  restatement  of  the  result  of  Gallo  (1982) 
mentioned  in  Section  1. 


Theorem  1  (Gallo,  1982).  Under  (2.4)  and  Assumptions  1  and  2, 


where  1^  is  the  q~dimens1ona1  identity  matrix. 
Proof.  Note  from  (2.4)  that 


1 

n 


Fj(e-Utf2) 

F£{e-Ua2)  +  U’(e-U?.2) 


However,  L.uma  1  implies  that 


l 

n 


(e-Ue2)  =  0p(n“'*), 


while  it  follows  from  (2.8)  that 


iu’(e-Ue2)  =  3 j2  -  s22s2  ♦  oD{l). 


From  these  facts,  (2.5)  and  Lemma  2  it  follows,  that 


Then 

'll  '12 

■  A1 2  A22  +  }  22  - 


and  it  follows  from  (2.9)  that 
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(2.10)  c'B  ~E—  c'3  +  C' 


.  ’o 


Q  (oy2~r 22^2^ ' 


Thus, 


A11  A12 


C*6  -2-*  C'B  «  c'  Q(a]2-};22B)  *  0,  all  B,  £ 


Clearly 


-An  a12 


=  0  *  c' 


A11  A12 


Q(°l 2"Z22B)  =  0 


for  all  3,  £.  On  the  other  hand,  if 


62  =  l22  °22  ‘  T22  ("A12  A1P  Iq)c’ 


0  =  c' 


'All  A12 


'An  Ai2 


■All  A12 


C  =  0 


»  c ' 


A-1  , 
'An  '12 


since  Q  >  0.  This  completes  the  proof. 


S'  V. 


*  •  v-»  •••  <•  •* 
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Note  that 


c  * 


,-l  a 

"An  Ai2 


•  i 

=  0  «  c  =  d  [Ip,  A^j  A12],  some  d. 


From  this  fact,  it  is  easily  seen  that  the  rows  of 


C  *  ^n*  A11  A19) 


p*  11  “12' 


serve  as  a  basis  for  the  linear  manifold  of  all  c  such  that  c'e  is  consistent 
for  c'£.  This  motivates  consideration  of  the  limiting  distribution  of 


Tn  »  nk  C(e-S). 


2.2  Asymptotic  Normality  of  T  .  Rather  than  state  our  main  result 
(Theorem  2)  at  once,  we  first  derive  a  representation  for  TR  that  leads  us  to 
the  extra  assumption  needed  to  obtain  asymptotic  normality  of  Tn. 

Let 


fl 

(l-in>L9  )  =  C  -  1 


it  follows  from  (2.5)  that 


(2.11) 


n-  *Lln’L2nJ  \  W 


Lenina  3.  Under  the  assumptions  of  Lemma  1, 

Lln  ’  all  +Vn’ 

and 

Gn*  Fi<F2  -  F1  «u  «12>y) 

—  H[44‘  K-Ji'"’ 

in  distribution  as  n-x». 

Proof.  The  first  assertion  is  a  direct  consequence  of  Lemma  2  and  the  fact 


[I 


Vl2  hz'hl!  "  ' 

Note  from  (2.4)  and  the  definition  of  W^n  that 

“in  *  n  Fi(F2"Flnn  a12,r  *  n  Fi*e,u)  (-U^) )  • 

The  second  assertion  of  the  lemma  now  follows  from  this  representation.  Lemma  1 
the  first  assertion  of  the  lemma  and  Slutzky's  Theorem.  □ 

Lemma  4.  Under  the  assumptions  of  Lemma  1, 


(2.12) 


w2n  *  Is  U'(e-U(s2^))  -  a22J,] 

'  tn  F2*F2‘Fl'']la12*'i22.^’'  +  V" 


and 


L2n  W’lJ  F)<F2-F,  a^12))[Q-'« ■  (!)]-'  ♦  0 ■  (n-i). 


Proof.  Using  (2.4)  and  the  definition  of  W2r),  we  can  write  W2fJ  as  the  sum  of 
the  first  two  terms  on  the  right-hand  side  of  (2.12)  plus 


I  F2(e-U(,J2  + , ))  -  1  U'(F1,F2)| 


'All  A12 


Using  Lemma  1,  this  last  term  can  be  shown  to  be  Op(n"s),  as  asserted. 
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From  facts  about  Inverses  of  partitioned  matrices,  the  definitions  of  C 
and  L2n  and  (2.4), 


L2„-<S  W’  rSFi<F2-F,  *;!  V  *SFiU)  A„ 


where 


An]  =  H*X'X  •  X'F1  <FlFi>“1  Ho¬ 


using  Lemma  2,  It  is  easily  shown  that 


An'  *  *22.1  +  r22  *  0pO  *  D'1  ♦  «„(')■ 


Using  Lemma  1, 


n”1  F*U  *  0  (n**). 


Since  n”1  FjFj  *  a^  +  o(1)  by  Assumption  1,  the  representation  for  L2n  given  by 
the  lemma  follows  from  Slutzky's  Theorem.  D 

Using  (2.8),  Assumption  1  and  Lemma  4,  it  is  straightforward  to  show  that 
M2n  =  0p(D.  Let 


(2.13) 


Zn  “  n  Fi(F2'Fl  A11  A12^ ' 


It  follows  from  (2.11)  and  Lemmas  3  and  4  that 


(2.14)  Tn  =  Gn  -  (Aj]+op(l))  Zny  -  (A7]+o(l))Zn[Q-1  *  op(l)3“1(op(l))  +  op(l) 


A  careful  look  at  (2.14)  shows  that  for  Tn  to  converge  In  distribution 
for  all  s,  z  It  is  necessary  that  Zn  be  0(1).  Thus,  we  are  led  to  make  the 
following  assumption 

Assumption  3.  For  every  sequence  f  defined  by  (2.6), 

11m  Zn  =  lim  n‘*  Fj (F2“FlAilA12)  *  Z($ 
n-*»  n-*“ 

where  the  limit  Z(f)  may  depend  on  £. 

That  Assumption  3,  tether  with  Assumptions  1  and  2,  Is  sufficient  for 

T  to  have  a  limiting  multivariate  normal  distribution  is  clear  from  (2.13), 
n 

Lemma  3  and  Slutzky's  Theorem.  This  is  our  main  result. 

Theorem  2.  Under  Assumptions  1,  2  and  3, 

Tp  =  n*  (C3-Ct>)  >  MVN(-,\j]  Z(f h,  (n'>:n)A“j) 

in  distribution  as  n-—,  where  C  =  (I  .^Ja^), 

i  =  (-22  *  “22  1^  ^"l2  ”  ^22^2^’  - 

3.  Discussion  and  Extensions .  Theorems  1  and  2  assume  that  the  sequence 
f  defined  by  (2.6)  is  a  sequence  of  fixed  vectors.  If  elements  of  the  vectors 
( f-j i  ,f^ )  in  this  sequence  are  random  variables,  one  can  think  of  these  results 
as  being  conditional  limit  theorems. 

When  components  of  each  (f^-.f^),  i  =  1,2,...,  are  random,  a  fairly  easy 
argument  can  be  used  to  extend  Theorems  1  and  2  to  apply  unconditionally, 
provided  that  Z  y,  where  Zn  =  Zn(f)  is  defined  by  (2.13),  has  an  asymptotic 


distribution. 
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Thus,  let  Sj  represent  the  random  part  of  ( f-j ^  *^2i ^  and  1et  k  3  (s^, 1*1,2,...}. 
Distributional  assumptions  about  the  s^  yield  a  probability  measure  u(^)  over 
the  sequences  s.  Suppose  that 

A  *  {$:  lim  n"1(F1,F2)'(F1,F2)  =  a  >  0,  11m  n‘MF.  ,F,)  •  0} 

n-+~  c  c  n-*» 

satisfies 

(3.1)  /  du(j^)  3  1  • 

In  other  words.  Assumptions  1  and  2  are  satisfied  with  probability  one.  Then 
Theorem  1  shows  that  for  all  £  in  A,  all  r.  >  0, 

11m  P{[tr(C8-Ce)'(C8-C8)P  >  e|.s>  *  0. 
n->® 

Thus,  by  the  Lebesgue  Dominated  Convergence  Theorem,  for  all  e  >  0, 

lim  P{[tr(C8-C8)'(C8-C8)]“  >  c}  =  0, 
n-*» 

and  hence  C8  converges  unconditionally  in  probability  to  Cb.  This  shows  that 

Theorem  1  holds  unconditionally  (over  £). 

In  a  similar  fashion,  it  can  be  shown  that  the  representation  (2.14)  for 

Tp  holds  unconditionally,  that  Gn  in  that  representation  has  the  limiting 

multivariate  normal  distribution  described  in  Lemma  3,  and  that  G  and  Z„ 

n  n 

are  asymptotically  statistically  independent.  Consequently,  if  aj]  Zn  y  has  a 

limiting  distribution,  the  limiting  distribution  of  Tn  is  the  convolution  of  the 

limiting  distributions  of  G„  and  Z  > . 

n  1 1  n 


IS 


Note:  The  above  discussion  Is  only  a  sketch  of  the  arguments  needed,  and  skips 
over  such  details  as  measurability.  A  more  extensive  discussion  In  a  similar 
context  can  be  found  in  Gleser  (1983). 

We  will  now  follow  the  steps  of  the  above  analysis  for  some  special  cases 
of  the  model  (2.4)  which  are  commonly  adopted  in  practice.  Recall  that  if 
f^.,  i  s  1,2,...,  are  random  vectors,  the  model  (2.4)  is  called  a  structural 

linear  errors-in-varlables  regression  model,  while  if  the  fgj,  1  *  1,2 .  are 

vectors  of  constants,  the  model  is  that  of  a  functional  linear  errors-in-variables 
regression  model.  Mixes  of  these  cases,  where  some  elements  of  f2i  are  fixed 
and  some  elements  are  random,  are  also  possible.  Further,  the  elements  of  f^ 
(except  for  the  first  component,  which  is  always  equal  to  1  to  accomodate  an 
intercept  term)  can  also  be  fixed  or  random.  Let 


We  will  consider  the  following  cases: 

(a)  both  hi  and  fixed  (functional  model), 

(b)  hi  random,  f,^  fixed  (functional  model), 

(c)  h.j  fixed,  f£..  random  (structural  model), 

(d)  both  h.  and  f^  random  (structural  model). 

3.1  Both  hj  and  fixed.  Theorems  1  and  2  already  summarize  what  we 

can  say  about  this  case.  Although  Theorem  2  has  some  technical  interest,  it  is 
unfortunately  rather  useless  for  statistical  applications.  Unless  we  are  in 
the  unlikely  case  where  we  either  know  the  limit  Z(f)  or  can  consistently  estimate 

*\i 

this  quantity,  we  cannot  use  Theorem  2  to  construct  large-sample  confidence  regions 
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for  C8.  Recall  that  {f2j,  1  a  1,2,...}  Is  a  sequence  of  unknown  parameters, 
and  that  the  Individual  vectors  fg-j  In  this  sequence  cannot  be  consistently 
estimated.  Thus,  very  strong  assumptions  are  needed  to  permit  us  to  consistently 
estimate  Z(f)  (or  aj]  Z(£)y). 

3.2.  hj  random  and  f^  fixed.  Here,  we  can  assume  that  the  vectors  h^ 
are  mutually  statistically  independent,  but  must  consider  the  possibility  that 

the  distribution  of  depends  upon  f^,  1  =  1,2 .  (That  is,  the  h . ' s 

are  not  identically  distributed.)  Given  the  linear  form  of  (2.4),  it  is  natural 
to  assume  that  a  similar  linear  model  relates  h^  to  f2j.  Thus,  we  assume  that 


(3.2) 


hi  =  a  +  *  f2i  +  t. ,  i  »  1,2,... 


where  the  t. 's  are  1.1. d.  with  mean  vector  0  and  covariance  matrix  A.  We  also 
assume  that 

<3-3)  ll!  »  ,!>*»•  "!  H  i.Vw  *  “22  >  0 

n+eo  i  =  i  n-x»>  1=1 

and  that  11m  n~*  f^  *  0,  all  1.  By  letting  f^  -  f^  -  u,  a-*  a+  it >u. 


n-+» 


a22  a22  “  uu';we  can  let  u  =  0  without  loss  of  generality. 

The  strong  law  of  large  numbers  shows  that 


11!  "  X  *'  =  °’  l1"  »  X  W  ‘  # 

n-*»  i  - 1  n-x»  i  =  i 


with  probability  one.  Using  (3.2),  (3.3)  and  Theorem  3  of  Chow  (1966), 
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However,  consider  the  special  case  *  a  0.  In  this  case  the  random  parts 
hj  of  fjj  are  1.1. d.  random  vectors  Independent  of  the  *s *  an<* 


Using  Corollary  3.2  and  the  discussion  following  in  Gleser  (1965),  It  can  be 
shown  that  the  elements  of  n  *  T'F^y  have  an  asymptotic  multivariate  normal 
distribution: 

n'^T’F2Y-  MVN(0,(y'a22y)a). 

Although  we  could  Impose  the  condition  that  n”*  IM^y  *  0(1),  this  is  a  rather 
restrictive  condition,  and  still  leaves  us  the  problem  of  estimating  the  limit 

_  j 

of  n  4  1T2y  *n  stat'*stica^  applications.  Instead,  we  settle  for  a  more  restricted 
result: 

(3.4)  n*  (0,1  p_1  )(C3-CB)  •>  MVN(0,6  ) 

in  distribution  as  n-*»,  where 
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since  a’1  =  (0,Ip_j  )a^| (0, Ip ) ' .  In  this  context  («P=0) ,  it  Is  worth  noting 
that 


c  =  ^°*Ip-l^Ip*"AllA12^ 

=  (o,ip_ro), 

so  that  the  result  concerns  the  estimates  of  the  slopes  (0,Ip_-| )b^  of  the 
on  the  h.  (the  random  part  of  f^.)  in  (2.4). 

3.3  h-  fixed  and  f^.  random.  In  analogy  with  the  discussion  in  Section 
3.2,  we  assume  that 

(3.5)  f2i  =  v  fli  +  V  1  *  1,2,..., 


where  the  are  i.i.d.  with  common  mean  vector  0  and  covariance  matrix  A. 
(Here,  since  the  first  element  of  fj.  is  always  1,  there  is  no  need  for  a 
separate  intercept  term.)  Assumption  (3.5)  is  commonly  adopted  in  instrumental 
variables  approaches  to  errors  in  variables  models  in  econometrics,  and  in 
ANCOvA  with  measurement  errors  in  the  covariates. 

We  also  assume  that 


(3.6) 


i  n 

lim  l 

i  =  l 


n-Ku 


f  f '  = 

rliTli 


'll 


and  that  lim  n_i-  f^  =  0,  all  i.  Following  steps  similar  to  those  used  in 
n-«° 

Section  3.2,  we  can  show  that  (3.1)  holds  with 
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An  An*’ 


ip^ll  i^Aii^'+A 


Hence , 


A1 1 A1 2  s  *' 


Note  that 


Zn  3  n  Fi(F2"FlAliA12J  =  n  *  F1T’ 


where  T'  =  (t|,...,t  ).  Applying  Corollary  3.2  and  the  following  discussion 
in  Gleser  (1965), 


A^ZnY  »  MVN(0,a"](y'Ay)) 


in  distribution  as  rw<».  Consequently, 


(3.7)  n*  (CB-C6)->MVN(0,Ajj  [n’Zn  +  y'AyJ) 


in  distribution  as  n-^.  It  is  worth  noting  that  here 


C  =  (Ip,*), 


A  =  a22<1, 


- ( 02+> ) 


When  *f?  =  0,  there  is  a  close  parallel  between  (3.4)  and  (3.7).  Note  also 


that  in  this  case  Cfl  =  e 


21 


Even  when  <i>  t  0  (the  distribution  of  depends  on  f^),  the  result 
(3.7)  was  obtained  without  the  need  to  make  extra  assumptions  on  the  higher 
moments  of  the  common  distribution  of  the  t^ ,  in  contrast  to  our  conclusions 
in  the  case  of  Section  3.2. 

3.4  Both  h^  and  f^  random.  In  this  case  it  is  more  natural  to  make 
assumptions  concerning  (h.!,  f^),  1  =  1,2,...  .  We  assume  that  these  vectors 
are  i.i.d.  with  a  common  mean  vector  p  and  a  common  covariance  matrix  $.  The 
strong  law  of  large  numbers  now  shows  that  (3.1)  holds  with 


Let  =  (pj.pg)  and 

'll  ‘12 
‘12  22 


where  are  the  common  mean  vector  and  covariance  matrix  of  the  h^'s. 

Thus, 

-1 


-1 

ir  12 


i 


'i 


'i  'll  ri 


*12  +ylp2 


^Va\V  Cof^j) 


22 


Let  H'  =  (h^,h2,...,hn).  Then 

Zn  3  "  *  (  H?)  (f2  '  1n^u2"wl<fr11^12)  ‘  “*11*12*  * 

The  Central  Limit  Theorem  shows  that  the  first  row  of  Zn  has  an 
asymptotic  multivariate  normal  distribution.  For  the  remaining  rows  of  Zn 
to  be  asymptotically  multivariate  normally  distributed,  additional  assumptions 
on  the  higher  moments  of  the  joint  distribution  of  (hj,f^)  are  needed.  To 
avoid  such  assumptions,  we  can  assume  that 

(3.8)  3  p2~*12*llul  *  *12*ll“i  *  *i *  ^  =  ^»2*»**» 

where  the  t^'s  are  i.i.d.  with  mean  vector  0  and  covariance  matrix 

*22.1  =  *22  "  *12  *11  *12 

and  statistically  independent  of  the  h.'s.  If  we  condition  on  the  h^s,  (3.8) 
is  the  model  (3.5)  with 

*  *  (w2“*12*llpr  *12*1 1  *  ’  A  =  *22. T 

We  can  now  use  the  results  of  Section  3.2,  noting  that  with  probability 
one  (over  sequences  h^,h2,...  ) 
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Thus,  conditional  on  the  h/s, 

(3.9)  n*  (Cs-Cs)  -  MVN(0,a^1[n,En+Y'*22.1T3) 

in  distribution  as  n-*».  Using  the  arguments  given  at  the  beginning  of  this 
section  about  converting  conditional  limiting  results  to  unconditional  limiting 
results,  we  conclude  that  (3.9)  also  holds  unconditionally. 

3.5  Conclusion.  The  results  (3.4),  (3.7),  (3.9)  can  be  used  to  construct 

a 

large  sample  confidence  ellipsoids  for  Ce  based  on  the  OLS  estimator  C8  provided 
that  consistent  estimators  can  be  found  for  the  covariance  matrices  of  the 
asymptotic  normal  distributions.  It  should  be  noted  that  in  general  C$  is  a 
function  not  only  of  6,  but  also  of  a^a^,  which  need  not  be  a  known  matrix. 
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